(Un)expected behavior of digital search tree profile

نویسندگان

  • Michael Drmota
  • Wojciech Szpankowski
چکیده

A digital search tree (DST) – one of the most fundamental data structures on words – is a digital tree in which keys (strings, words) are stored directly in (internal) nodes. Such trees find myriad of applications from the popular LempelZiv’78 data compression scheme to distributed hash tables. The profile of a DST measures the number of nodes at the same distance from the root; it is a function of the number of stored strings and the distance from the root. Most parameters of DST (e.g., height, fill-up) can be expressed in terms of the profile. However, from the inception of DST, the analysis of the profile has been elusive and it has become a prominent open problem in the area of analysis of algorithms. We make here the first, but decisive, step towards solving this problem. We present a precise analysis of the average profile when stored strings are generated by a biased memoryless source. The main technical difficulty of analyzing the profile lies in solving a sophisticated recurrence equation. We present such a solution for the Poissonized version of the problem (i.e., when the number of stored strings is generated by a Poisson distribution) in the Mellin transform domain. To accomplish it, we introduce a novel functional operator that allows us to express the solution in an explicit form, and then using analytic algorithmics tools to extract the asymptotic behavior of the profile. This analysis is surprisingly demanding but once it is carried out it reveals unusually intriguing and interesting behavior. The average profile undergoes several phase transitions when moving from the root to the longest path. At first, it resembles a full tree until it abruptly starts growing polynomially and it oscillates in this range. Our results are derived by methods of analytic algorithmics such as generating functions, Mellin transform, Poissonization and de-Poissonization, the saddlepoint method, singularity analysis and uniform asymptotic analysis. The work of this author was supported in part by the Austrian Science Foundation FWF Grant No. S9604, and by the NSF Grants CCF-0513636, DMS-0503742, CCF -0830140, and DMS0800568, NIH Grant R01 GM068959-01, NSA Grant H98230-081-0092, and the AFOSR Grant FA8655-08-1-3018. This work was completed during a visit at Hewlett-Packard Laboratories, Palo Alto, CA. Inst. Discrete Mathematics and Geometry, TU Wien, A-1040 Wien, Austria, [email protected]. Department of Computer Science, Purdue University, West Lafayette, IN 47907-2066 U.S.A., [email protected]

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Probabilistic analysis of the asymmetric digital search trees

In this paper, by applying three functional operators the previous results on the (Poisson) variance of the external profile in digital search trees will be improved. We study the profile built over $n$ binary strings generated by a memoryless source with unequal probabilities of symbols and use a combinatorial approach for studying the Poissonized variance, since the probability distribution o...

متن کامل

The Expected Profile of Digital Search Trees ∗ December 10 , 2009

A digital search tree (DST) is a fundamental data structure on words that finds myriad of applications from the popular Lempel-Ziv’78 data compression scheme to distributed hash tables. It is a digital tree in which strings (keys, words) are stored directly in (internal) nodes. The profile of a DST measures the number of nodes at the same distance from the root; it is a function of the number o...

متن کامل

The expected profile of digital search trees

A digital search tree (DST) is a fundamental data structure on words that finds various applications from the popular Lempel-Ziv’78 data compression scheme to distributed hash tables. The profile of a DST measures the number of nodes at the same distance from the root; it depends on the number of stored strings and the distance from the root. Most parameters of DST (e.g., depth, height, fillup)...

متن کامل

The Expected Profile of Digital Search Trees ∗ March 24 , 2011

A digital search tree (DST) is a fundamental data structure on words that finds various applications from the popular Lempel-Ziv’78 data compression scheme to distributed hash tables. The profile of a DST measures the number of nodes at the same distance from the root; it depends on the number of stored strings and the distance from the root. Most parameters of DST (e.g., depth, height, fillup)...

متن کامل

The Variance of The Profile in Digital Search Trees

Author(s): R. Kazemi and M. Q. Vahidi-Asl Abstract: What today we call digital search tree (DST) is Coffman and Eve’s sequence tree introduced in 1970. A digital search tree is a binary tree whose ordering of nodes is based on the values of bits in the binary representation of a node’s key. In fact, a digital search tree is a digital tree in which strings (keys, words) are stored directly in in...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009